Recognition of Character Strings Printed with Large Alignment Error

نویسندگان

  • Minenobu Seki
  • Toshikazu Takahashi
  • Takeshi Nagasaki
  • Hiroshi Shinjo
  • Katsumi Marukawa
چکیده

Optical character reader (OCR) technology for reading documents, such as monetary transaction documents, is becoming more and more important than ever before.The position of the printed character string is sometimes largely shifted from its designated position, and there may be two or more directions in spite of one sheet of paper. There are various causes, the performance of the printer, a mistake in the printing position designed by the software, a variation in the cell positions caused by the publishers and by the publishing dates, or even a mistake of the handwriting position. We developed a recognition method for determining which character strings are to be read in such difficult situations. A method for determining the correspondence of character strings to cells with a very high success ratio was developed. This method is based on local and global rules, and effective control of these rules. It was estimated to have a 99.2% success ratio using an experiment on 11,387 character strings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Modfied Self-organizing Map Neural Network to Recognize Multi-font Printed Persian Numerals (RESEARCH NOTE)

This paper proposes a new method to distinguish the printed digits, regardless of font and size, using neural networks.Unlike our proposed method, existing neural network based techniques are only able to recognize the trained fonts. These methods need a large database containing digits in various fonts. New fonts are often introduced to the public, which may not be truly recognized by the Opti...

متن کامل

Recognition of Isolated Multi-Oriented Handwritten/Printed Characters using a Novel Convex-Hull Based Alignment Technique

Handwritten character recognition is one of the difficult tasks of pattern recognition due to diverse writing styles. The problem becomes more severe if the characters are written in a cursive fashion with varying orientations. Also there may exist printed characters of different shapes/fonts and sizes in a document image. In the current work, we have presented a novel convex hull based alignme...

متن کامل

Chip Refinement Character Recognition Text Clean - up I 2 Segmentation Texture Segmentation Texture Segmentation Texture Segmentation Texture Generation

There are many applications in which the automatic detection and recognition of text embedded in images is useful. These applications include multimedia systems, digital libraries, and Geographical Information Systems. When machine generated text is printed against clean backgrounds, it can be converted to a computer readble form (ASCII) using current Optical Character Recognition (OCR) technol...

متن کامل

A method for connected hand-printed numeral recognition using hidden Markov models

A method for the recognition of hand-printed numerals using hidden Markov models is described. The method involves the representation of 2D images of a character with two 1D models, one for the pixel columns of the image and the other for the rows. Various normalisations are applied to both the training and test data to reduce variations between characters within a class, resulting in a corresp...

متن کامل

An Optical Character Recognition System from Printed Text and Text Image using Adaptive Neuro Fuzzy Inference SystemAn Optical Character Recognition System from Printed Text and Text Image using Adaptive Neuro Fuzzy Inference System

This is the age of digital systems. Now a days, everything is being computerized. Peoples are using mobile phones, laptop, computer, camera, notebook, pdf reader etc digital systems too much than ever. Use of papers and pen, printed books are decreasing. Rather peoples are using digital means of communication, study, documentation. Optical character recognition is an application of these digita...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005